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1 Introduction 

Let D be a matrix whose rows and columns are indexed by a set X. We 
assume that D is symmetric and has zero entries on the main diagonal. In 
phylogenetics, these kind of matrices are called dissimilarity matrices . Usually, 
we take X = [n] :— {1,2, ... ,n}. Hence a dissimilarity matrix D can also be 
seen as a map D : [n] 2 — > R, with D(i,j) = D(j,i) and D(i,i) = for each 
i,j e [n]. 

A metric is a non-negative dissimilarity matrix which satisfies the triangle 
inequality D(i,j) < D(i, k) + D(k,j) for all i,j, k € X. 

We say that D has a graph realization if there is a weighted graph (so a non- 
negative weight is assigned to each edge) whose node set contains X and such 
that the distance (i.e. the length of the shortest path) between nodes i,j € X 
is exactly D(i,j). A distance matrix is a non- negative dissimilarity matrix that 
has a graph realization. In [3l H] , one can find some results on these kind of 
matrices. 

In the case the graph is a tree and X corresponds to the set of leaves, 
D is called a tree metric. This case has been studied intensively and is well 
understood. The main result is the following (see [2] or [51 Theorem 2.36]). 

Theorem 1.1 (Tree Metric Theorem). Let D be a non-negative dissimilarity 
matrix on [n] . Then D is a tree metric on [n] if and only if, for every four ( not 
necessarily distinct) elements i,j,k,l G [n], the maximum of the three numbers 
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D(i,j) +D(k, I), D(i, k) + D(j, I) and D(i, I) + D(j, k) is attained at least twice. 
Moreover, the tree T with leaves [n] that realizes D is unique. 

The condition of the theorem is called the four-point condition. It is a 
necessary and sufficient condition on a matrix to be realized by a tree. 

Tree metrics on n leaves are parameterized by the space of trees T n C 
The following result gives us a description of T n (see [T]). 

Theorem 1.2. The space of trees T n is the union of (2n— 5)!! = 1.3.5 . . . .(2n— 5) 

orthants isomorphic to K>q~ 3 . More precisely, T n is a simplicial fan of pure 

dimension 2n — 3 in 

We can consider a generalisation of the concept of dissimilarity matrix. Let 
m < n be an integer. A map D : [n] m — > M is called an m- dissimilarity map if 

D(ii, . . . ,im) = D(i„(i), . . . ,i„( m )) 

for all permutations tt G S m and D(ii, i%, . . . , i m ) = if the numbers i±, . . . , i m 
are not pairwise distinct. 

We say that D is realized by a tree T if the leaf set of T is [n] and if for 
each m-subset V = {ii, . . . ,i m } C [n], the weight of the smallest subtree of T 
containing V is equal to D{i\, . . . , i m ). An important result on m-dissimilarity 
maps of trees is given in [5]. 

Theorem 1.3. Let T be a tree with n leaves and no vertices of degree 2. Let 
m > 3 be an integer. If n > 2m — 1, then T is uniquely determined by its 
m-dissimilarity map D. If n = 2m — 2, this is not true. 

In this paper, we give a description of a map 4>( m > : sending 
the distance matrix of a tree T to its corresponding m-dissimilarity map (see 
Theorem 13.21 in Section [3]). In Section [U we investigate the case m = 3. In 
particular, we show that 4>^{%i) IS equal to the intersection of the tropical 
Grassmannian Qs, n with a linear space (see Theorem l4.6p . In Section^ we give 
a partial result on the case m — 4. An introduction to tropical gemetry is given 
in Section [2j 

To finish this section, we describe the relation with Phylogenetics. A classical 
problem in computational biology is to construct a phylogenetic tree from a 
sequence alignment of n species 

Species 1 ACAATGTCATTAGCGATACGTAGGTACGATGC . . . 

Species 2 ACGTTGTCAATAGAGATTTTGGATGAACGATA . . . 

Species 3 ACGTAGTCATTACACATTCTGGATTAACGTTA . . . 

Species 4 GCACAGTCAGTAGAAGCTATGGTACATCGATC . . . 



Species n GAACTGTCAGTAGAAGCGAGTGTACATTCGTT. . . 

The main technique to select a tree model is computing the maximum like- 
lihood estimate (MLE) for each of the (2n — 5)!! trees. Unluckily, all the MLE 
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computations are very difficult, even for a single tree, and this approach requires 
examining all exponentially many trees. 

A popular way to avoid this problem is the so-called distance based approach, 
where one collapses the data to a dissimilarity matrix and obtains a tree via a 
projection onto tree space T n (by using the neighbor-joining algorithm). In fact, 
for such sequence data, computational biologists infer the distance between any 
two taxa. Thus, an interesting problem of phylogenetics concerns the construc- 
tion of a weighted tree which represents this distance matrix, provided such a 
tree exists. 

More general, we may think of an m-dissimilarity map as a measure of how 
dissimilar each subset of m species is. As a generalization of the previous prob- 
lem, we can search for a weighted tree such that the m-subtree weights represent 
the entries of the m-dissimilarity map. This problem has some natural relevance 
in Phylogenetics. Indeed, for example, it can be more reliable statistically to 
estimate the triple weights D(i,j, k) rather than the pairwise distances D(i,j) 

(0, M)- 

2 Tropical geometry 

The tropical semiring (R U {— oo},(B,<g>) is the set of real numbers completed 
with — oo, equiped with two binary operations: the tropical sum is the maximum 
of two numbers and the tropical multiplication is the ordinary sum. 

Tropical monomials ij 1 • ■ • xt* represent ordinary linear forms 53i=i a i x i 
and tropical polynomials 

(§\ a ®x a 1 1 ®---®x a k k , (1) 

aeA 

with icN' finite and A a £ K, represent piecewise-linear convex functions 

fe 

F : M. k -> K : (x 1: . . . ,x k ) i-> max{A Q + y^a i x l }. (2) 

1=1 

Now let K be the field of Puiseux series, i.e. the field of formal power series 
a = J2 q eQ a qt q m the variable t such that the set Q a = {q G Q | a q ^ 0} is 
bounded below and has a finte set of denominators. For such an a, the infimum 
of Q a is equal to the minimum and we call it the valuation val(a) of a. 

A polynomial 

/On, • • • , x k ) = 9a(t)x a ^ ■ ■ ■ x a k * e K[X] 

gives rise to the tropical polynomial in |T]), where A„ = — val(g a (t)). We denote 
this tropical polynomial by trop(/). 

We define the tropical hypersurface T{F) = T(trop(/)) as the corner locus 
of the function F in @, i.e. the set of x — {x\, . . . ,Xk) G K fc such that the 
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maximum of the collection of numbers 

< ^2 a i x i + } 

I *=1 ) a£A 

is attained at least twice. 

Theorem 2.1. If I C K[xi, . . . , x n ] is an ideal, the following two subsets ofR k 
coincide: 

1. the intersection of all tropical hypersurfaces T [trop{f )) with f 6 I ; 

2. the closure in R fe of the set 

{(-val(y 1 ),...,-val(y k ))\(y 1 ,...,y k )GV(I)}cQ k . 

Proof. See Theorem 2.1]. □ 

For an ideal / C K[x%, . . . , x k \, we denote by T(I) C M. k the set mentioned 
in Theorem 12.11 It is called the tropical variety of the ideal /. 

Definition 2.2. IfT(I) C R fe is a tropical variety, we say that {/i, . . . , f r } is 
a tropical basis ofT(I) if and only if I = (f\, . . . , f r ) and 

T(I) - T{trop{h)) n • • • n T{trop(f r )). 

Remark 2.3. In general, a set of generators of an ideal I is not a tropical 
basis for T{I). Of course, the singleton {/} is a tropical basis for the tropical 
hypersurface T(trop(/)). 

We are mainly interested in the tropical variety T(I m>n ), where I m ,n is the 
ideal of the Grassmannian G(m, n) C Rvm) . To be more precise, we fix a 
polynomial ring 

Z[x] = Z[x ili2 ... ld 1 1 < i\ < i 2 < ■ ■ ■ < i m < n] 

in (^) variables with integer coefficients. The Pliicker ideal I m ,n is the prime 
ideal in Z[x], consisting of the algebraic relations among the determinants of 
the (m x 77i)-minors of any (m x n)-matrix with entries in a commutative ring. 
It is well-known that I m ,n is generated by quadrics (see for example [8]). 

The affine variety defined by I m ,n is the Grassmannian G(m,n) C M^" 1 ), 
which parameterizes all m-dimensional linear subspaces of an n-dimensional 
vector space. It has dimension (n — m)m + 1. 

Definition 2.4. The tropical variety T(I m ^ n ) is called a tropical Grassmannian 
and is denoted by Q m ,n- 

Theorem 2.5. The tropical Grassmannian Q m . n is a polyhedral fan in R^ m ). 
Each of its maximal cones has the same dimension, namely (n — m)m + 1. 
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Proof. See Corollary 3.1.]. □ 

Now we are going to fix our attention on the case m — 2. 

Example 2.6 (m = 2 and n = 4). The smallest non-zero Plucker ideal is the 
principal ideal I2.4 — (^12^34 — ^13^24 + £14^23)- Thus £/2,4 is a fan with three 
five-dimensional cones R 4 x R<o glued along M 4 . 

Theorem 2.7. TTie idea? 72,n *s generated by the quadratic polynomials 

Pijki '■= XjkXjl ~ x i3 x k i - XuXjk (1 < i < j < k < I < n). (3) 

These polynomials form the reduced Grobner basis if the underlined terms are 
leading. 

Proof. See [5J Theorem 3.1.7 and Proposition 3.7.4]. □ 

For each quadruple {i,j, k, 1} C {1, 2, . . . , n}, we consider the tropical poly- 
nomial 

trop^jw) = (xij ® x k i) © (a;,fe ® sCjj) © (a;« ® a;^). 

This polynomial defines a tropical hypersurface T(trop(pijki))- It turns out that 
the tropical Grassmannian G2.n is the intersection of these (?) hypersurfaces, 
so the quadrics pijki forms a tropical basis for Ii. n (see 0). 

Let D be an dissimilarity matrix on [n] and {i, j, fc, 1} C [71]. The maximum 
of the three numbers j) + D(fc, i), D(i, k) + D(j, I) and D(i, I) + D(j, k) is 
attained at least twice if and only if D S T(trop(pyfci)). Thus Theorem 11.11 
implies that a metric D on [n] is a tree metric if and only if D belongs to T n . 
In particular, one has the following result. 

Theorem 2.8. The space of trees T n is the tropical Grassmannian Qi,n- 

Proof. See Theorem 4.2] or the arguments above. □ 

Now we come back to the general case (so the case where to < n is arbitrary) . 
The ideal I m ,n is generated by quadratic polynomials, known as the Plucker 
relations. Among these are the three-term Plucker relations 

PR,ijkl '■= XRikXRjl — %Rij%Rkl ~ %RilXRjki 

which are closely related to (J3|). Hereby R is any (m — 2)-subset of [n] and 
i,j,k,l G [n] \ R. 

Definition 2.9. The three-term tropical Grassmannian T m n is the intersection 
%n,n-= P| T(trop(pn t ijki)) CRW. 

R,i,j,k,l 

In general, the three-term Pliicker relations do not generate I m ,n- If wi = 2, 
then S — and 72 jTl = Q2,n- For to > 3, the tropical Grassmannian Q m , n is 
contained in T m ^ n . This containment is proper for n > to + 4. 
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3 A description on the m-subtree weight map 

In this section, we are going to give an explicit description of a map 



sending the dissimilarity matrix D of a tree T to its m-dissimilarity map. 
Let -< be the order relation on N°° defined as follows. We have 



if and only if there exists anneN such that a, = hi for alii < n and a n < b n . 

Let T be a tree with n leaves. Let r be an inner node of T and consider T 
as a rooted tree (with root r). Let Af be the set of nodes of T. In particular, 
the set of leaves [n] — {1, . . . , n} is contained in Af. 

Lemma 3.1. There exists a map a : Af — > N°° such that the following properties 



1. a is injective. 

2. If n G Af is an ancestor of m G Af, we have a(m) y a(n). So the root r 
ofT gives rise to the minimum of {a(n)\n G Af}. 

3. If ni,n 2 G Af with n 2 not a descendant nor an ancestor of n\, m\ £ Af a 
descendant ofn\ andm 2 G Af a descendant ofn 2 , we have a(mi) -< a(m 2 ) 
if and only if a(n\) -< a(ri2)- 

Proof. We will define a inductively. Take a(r) = (0, 0, 0, . . .). For the induction 
step, if a(n) = (a\, . . . , a s , 0, 0, . . .) is defined for some n G Af with a s ^ and 
if mi, . . . , TO* are the children of n, take a(ni) = (ai, . . . ,a s ,i,0, . . .). Note that 
all the properties hold and that the depth of n G Af in T is equal to the number 
of non-zero entries in a(n). □ 

We say that the leaves of T are well-numbered if and only if a(i) -< a(j) for 
all i < j. 

A permutation a G S m of {1, ... , m} is called cyclic if and only if the decom- 
position of a into a product of disjoint cycles consists of only one cycle of order 
m. Denote the set of cyclic permutations in S m by C m . Note that a m — Id if 
a G C m . 

Theorem 3.2. Let n and m be integers such that n > m > 2. Let 




(a 1 ,a,2,a 3 , ...)-< (61,62,63, • • •) 



hold: 



6e i/ie map 



If D £ Qi, n C M^ 2 ) is i/ie dissimilarity matrix of an n-tree T, then the m- 
dissimilarity map ofT is equal to <ft m \D). So the set of m-dissimilarity maps 
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Proof. Write 



f(X; o~;ii, . . . , i m ) — X il ^ a{1) + X ia{1) : ^ 2(1) + . . . + X i ^ m _ 1 ^ iam{1) . 
Note that 

f(X; c; ■ ■ ■ ,i n (m)) = f(X;TraTr^ 1 ;i 1 , . . . ,i m ) 
for all 7r G S mi hence 

"^{/(^l^iirfi) »ir(m))}= mm {f (X ; a; i x , . . . , i m )} . (4) 

We have to prove that the weight D(i±, . . . , i m ) of the smallest subtree T" of 
T containing the leaves ii,...,i m is equal to | • mm ae c m {f(D; a; it, . . . , i m )}. 
It is enough to prove this for ii = 1, . . . , i m — m (the general case is proved 
completely analogously). By equation (U), we may also assume the leaves of T" 
are well-numbered. 

Let e = (x, y) be an edge of T" with y a child of x. We claim that for all 
it G C m , the weight w(e) of e is taken into account in at least two of the m terms 
of 

/(£>; a; 1, . . . , m) = D(l, a(l)) + £)((t(1), a 2 (l)) + . . . + D^-^l), 1) 
and in exactly two of the summands of 

f(D; t; 1, . . . , m) = D(l, 2) + £>(2, 3) + . . . + D(m, 1), 

where 



1 2 ... 771 — 1 m 

2 3 ... m 1 



GC„ 



Using this claim, we immediately see 



D{i x , ...,im) = \- f(D', r; 1, . . . ,m) = i ■ min {/(£>; <r; 1, . . . , m)}. 

To finish this theorem, we only need to prove the claim. Consider the split 
of T' induced by e and let T" be the component of the split containing y (hence 
T" is the maximal subtree of T' containing y but not x). Denote the set of 
leaves of T" by L" . We may assume 1 G L" (the case 1 L" is analogous). 
Note that in this case L" is of the form {1, . . . , s} for some s < m. 

The weight of e is taken into account in the term D(i,j) (i.e. the path 
between the leaves i and j of T" passes e) if and only if i G L" and j £ L" or 
vice versa. Thus w(e) is only counted in the two terms D(s, s + 1) and D(m, 1) 
of /(D;r; 1, . . . ,m). 

So it is enough to show that there exists aiG {0 . . . , m— 1} such that <7* (1) G 
L" and cr t+1 (l) ^ I" (the other case is proved analogously). If we assume this 
is not the case (so er*(l) G L" implies cr t+1 (l) G I"), we get L" = {1, . . . , m}, a 
contradiction. □ 
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Corollary 3.3. If D G Q2.11 C rW, we have that D(i±, . . . ,i m ) is equal to 

® (^(«l^a(l))®^(*<T(l),*<T 2 (l))®---®-D(V"-i(l)'V"(l))) 

Remark 3.4. In each component D(ii, . . . ,i m ), the minimum is attained at 
least twice. Indeed, assume the minimum is attained for a = r. Since 

f(D;r;h, ...,i m ) = f(D;r~ 1 ;i 1 , ...,i m ), 

the minimum is also attained for a = t _1 . Note that this could be use- 
ful for computations, since it permits us to consider only J-rp permutations. 
Furthermore, if {ij,ik} is a cherry of T", the minimum is also attained for 
a = (jk) o r o (jk), whereby (jk) is the transposition in S m switching j and fc. 

Remark 3.5. The map 0^™^ is not injective on the whole domain r( 2 ). For 
example, consider D, D' 6 r( 2 ), whereby D(i,j) = 1 for all 1 < i < j < n and 
D' only differs from D in the last coordinates, with D'(n — l,n) = 2. Clearly, 
one has D G £/ 2 ,n, £>' £ 02,™ and <?(>( m )(£>) = </><» (£)'). However, Theorem O 
implies that the restriction of i^ 1 ™) to <?2,n is injective if n > 2m — 1. 

Proposition 3.6. (m) (^2,«) C T m , n n^(rf)) 

Proof. The inclusion (m) (&,n) C <£( m )(R(2)) is obvious, while </> (m) (&!,«) C 
%n,n follows from [5]. For sake of completeness, we include the proof in this 
paper. 

Consider a tree T with leaf set [n] and distance matrix D. Let i? be an 
(to — 2)-subset of [n] and i, j, k, I £ [n] \ R. We have to prove that 

0("»)(D) G T(trop(p il , iiW )). 

Let [i?] be the smallest subtree of T containing the leaves in i? and let T" 
be the tree obtained from T by contracting [i?] to a point. Denote by etc. 
the images of respectively i,j, etc. in T'. Note that R 1 is a leaf of T". We have 

D(R,i,j)=D'(R',i',j')+D(R), 

hence 0( m )(I?) G r(trop(p R)ijW )) if and only if ^^(D') G T(trop( PiW ^)), 
where £)' is the distance matrix of T". 
Now Remark |4 . 1 1 below implies 

D'(R',i',j') = ±(D>(i>,f)+D'(i>,R')+D>(j',R% 

so cj)^(D') e T(tTop(p R > t i>j>k>i')) if and only if D' G T(trop(p^ fc ^))- Hence 
the statement follows from Theorem ll.il □ 
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4 The 3-dissimilarity maps of trees 

Denote the coordinates of rW by X(i,j) (here we index over all integers i,j 
with 1 < i < j < n) and the coordinates of by X(i,j,k) (here we index 
over all integers i,j, k with l<i<j<k<n). Recall that if D G C?2,n is a 
tree, D(i,j) is the distance between leaf i and leaf j. 

Remark 4.1. Since C3 = {<7i,(72} with 

cn=Q 3 f) and a 2 = (a,)- 1 = Q J ^ , 
i/ie map 0( 3 ) sends X = (X(i,j))i.j to (X(i,j,k))ij t k with 

X(i, j, fc) = i ■ j) + X(i, k) + X(j, k)). 

So if D G ^2,n; £Ae 3-subtree weights of the tree D are given by D(i,j,k) — 
±-(D(i,j) + D(i,k)+D(j,k)). 

The following results states that for the case m = 3 the equality holds in 
Proposition 13.61 if n > 5. 

Proposition 4.2. Ifn>5, we have </> (3) (£ 2: „) = %,n H (3) (M©) 

Proof. By Proposition 13 -61 it is enough to show that for a general point P G 
0(3) n 7^ ni there exists a point P S C/ 2 ,n such that (3) (D) = P. Since 

P 6 0< 3 )(]r(2)), there exists a point D G k(=) such that </>( 3 )(L>) = P. It suffices 
to prove that D G C?2,n- In order to do this, we show that in each triplet 

{D(i, j) + D(k, I), D{i, k) + D(j, l),D(i, k) + D(j, k)}, 

the maximum is attained at least twice. Fix S G [n] \ {i,j,k,l} (n > 5). Since 
P £ %,m i n the triplet 

{P(S, i, j) + P(S, k, l),P(S, i,k) + P(S, j, I), P(S, 1, 1) + P(S, j, k)}, 

the maximum is attained at least twice. Note that 

P(S, + P(S, k,l) = ^(C + D(i, j) + D(k, 0), 

P(S, i, k) + P(S, 3, 1) = \{C + D(i, k) + D(j, I)), 

P(S, i, I) + P(S, 3, k) = i(C + D(i, k) + D(j, k)), 

where C = D(S,i) + D(S,j) + D(S,k) + D(S,l). Hence the maximum in 
{D(i,j) + D(k,l),D(i,k) + D(j,l),D(i,k) + D(j,k)} is also attained at least 
twice, thus D G G 2 , n and P G (3) □ 
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For the proof of the proposition below, we need an extra definition. 

Definition 4.3. An ultrametric D on [n] is a metric which satisfies the following 
strengthened version of the triangle inequality: 

Vi,j,k£ [n] : D(i,j) < max{D(i,k),D(j,k)}. 

Equivalently, at least two of the three terms D(i,j), D(i, k), D(j, k) are the same. 

Remark 4.4. In general, the dissimilarity matrix D of a tree T is not an ultra- 
metric. In case D G Q2,n is an ultrametric, we can realize D by an equidistant 
tree, i.e. a rooted tree such that the distance F between the root and each leaf 
is equal. In particular, 2F — max{D(i, j) \ i,j £ X and i ^ j}. 

Proposition 4.5. </> (3) (£ 2 ,n) C G 3 ,n 

Proof. Let T be a tree with 3-dissimilarity map 

P = (D(i,j,k)) iijtk = ^) ((D{h])t j) e 00) ( &in ) c rG). 

If M G K 3xn , we denote the (3 x 3)-minor with columns i, j, k by M(i, j, k). By 
Theorem 12.11 Q^^ n is the closure in ftW of the set 

S := {(-val(det(M(i,i,fc)))) iii)fe |M e if 3x "} C Qft). 

Assume first that all the edges of T have rational weights, a fortiori P £ qW. 
We are going to show there exists a matrix M € K 3xn such that 

D(i,j,k) = -val(det(M(i,i,fe))). 

Fix a rational number E with E > D(i,n) for all i £ {1, . . . ,n — 1} and 
define a new metric D' by 

j) = 2E + D(i, j) - D(i, n) - D(j, n) 

for all different i,j € [n] (in particular, D'(i,n) = 2E for i ^ n). Note that 
D' G G2.11 and that D' an ultrametric on {1, . . . , n — 1}, so it can be realized by 
an equidistant (n — l)-tree T" with root r. Each edge e of T" has a well-defined 
height h(e), which is the distance from the top node of e to each leaf below e. 
Pick a random rational number a(e) and associate the label a(e)t 2h ^ to e. If 

i 6 {1 n- l}isa leaf of T", define the polynomial Xi(t) by adding the labels 

of all edges between r and i. It is easy to see that D'(i,j) = deg(xj(t) — Xi(t)) 
for all i,j G {1, . . . ,n — 1}. 

Denote the distance from r to each edge by F. Since 

2F = max{D'(i,j) 1 1 < i < j < n - 1} < 2E, 

we have F < E. The metric D' on [n] can be realized by a tree T', where T 1 is 
the tree obtained from T" by adding the leave n together with an edge (r, n) of 
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length 2E — F. If we define x n (t) = t 2E , we get that D'(i, j) = deg(xj (t) — Xi(t)) 
for all i,j G [n]. 

Now consider the matrix 



M' 



1 1 1 ... 1 

Xi(t) x 2 (t) x 3 (t) ... x n (t) 
Xl{t) 2 x 2 {tf x 3 (t) 2 ... x n (t) 2 



We have det(M'(i,j,k)) = (xj(t) — Xi(t))(xf.(t) — Xi(t))(xk(t) — %j(t))i hence 

D'(i,j) +D'{i,k) + D'(j, k) = deg(det(M'(i, j, *))). 

Let M be the matrix obtained from M' by multiplying, for each i, the «-th 
column of W by (^(^«)-^)2 Since 

D(i,j) = D'(i,j) + (D(i,n)-E) + (D(j,n)-E) 

= dcg • iP{j,^)-E . _ ( 

we get that D(i,j) + D(i,k) + D(j, k) = deg(det(M(i, j, fc))). If we replace each 
i in M by t^ 1 / 2 , we get 

D(i,j,k) = -val(det(M(*,j,fc))). 

Now assume T has irrational edge weights. We can approximate T arbitrarily 
close by a tree T with rational edge weights. From the arguments above, it 
follows that the 3-dissimilarity map D of T belongs to S, hence D 6 ^3 jT i. □ 

Theorem 4.6. J/n > 5, we have (£ 2 ,n) = (3) (^' 3 *)) H £ 3 ,„. 

Proof. The statement follows from Proposition ^. 21 Proposition ^. 51 and the fact 
that £ 3 , n C T 3 ,„. □ 



5 The 4-dissimilarity maps of trees 

In this section, we give a geometric description of cf>^ (<?2,n)- 
Remark 5.1. The set C4 = {o~\, ^ , (T2, cr^ 1 , (73, c^ 1 } with 

_ A 2 3 4\ _ A 2 3 4\ _ A 2 3 4\ 
CTl -(,2 3 4 lJ' CT2 -(,2 4 1 sj'^-l^ 4 2 l) ' 

Hence the map tp^ sends (X(i,j))ij to (X(i,j,k,l))i t j t k,i where X(i,j,k,l) is 
equal to the minimum of the three terms 

X(l, 2) + X(2, 3) + X(3, 4) + X(A, 1), 
X(l, 2) + X(2, 4) + X(A, 3) + X(3, 1), 
X(l, 3) + X(3, 2) + X(2, 4) + X(A, 1), 

divided by two. 
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Consider M = and take X(i,j;k,l), with {i,j, k,l} C [n] a 

quadruple, as coordinates on M. For example, X(j,i;l,k) = X(i,j;k,l), but 
X{i,k;j,l) ^ X(i,j;k,l) and X(k,l;i,j) ^ X(i,j;k,l). 

Let TT : R(2) -► M : j))^- h-» (X(i,j;k,l)) i:jM with 

j; fc, = i ■ (X(i,j) + X(k, I) + mm{X(i, k) + X(j, I), X(i, I) + X(j, k)}). 

Let L be the linear subspace of M consisting of points X(i,j;k,l) with 
X{i,j;k,l) = X(i,k;j,l) = X(i,l;j,k) = X(j,l;i,k) = X(j,k;i,l) = X(k,l;i,j) 

for all different k, I G [n]. Points in L can be projected naturally to by 
sending X(i, j; k, I) to X(i,j, k, I). Denote this projection by p. 

Proposition 5.2. ^ (G 2 ,n) = p(tt(r(S)) n L). 

Proof. Note that for any real numbers a, 6, c, we have 

a + min{fe, c} — b + min{a, c} = c + min{a, b} (5) 

if and only if max{a, 6, c} is attained at least twice. If the latter holds, the terms 
in ([5]) are equal to min{a + b,a + c,b + c}. 

If we take a = X(i,j)+X(k, I), b = X(i, k)+X(j, I) and c = X(i, l)+X(j, k), 
the statement follows from the Tree Metric Theorem. □ 
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